STRUCTURE OF LARGE INCOMPLETE SETS IN ABELIAN 

GROUPS 

VAN H. VU 



Abstract. Let G be a finite abelian group and A be a subset of G. We 
say that A is complete if every element of G can be represented as a sum of 
different elements of A. In this paper, we study the following question: 

What is the structure of a large incomplete set ? 

The typical answer is that such a set is essentially contained in a maximal 
subgroup. As a by-product, we obtain a new proof for several earlier results. 



1. Introduction 



Let G be an abelian group and A be a subset of G. We use Sa to denote the 
collection of all subset sums of A 

S A ■■= {J2 a \ B c A >\ B \ <0 °}- 

aeB 

We will keep this notation when A is sequence of (not necessarily different) elements 
of A. In this case Sa is the collection of all subsequence sums of A. Z n denotes the 
cyclic group of order n. 

Example. Take G = Z lx . If A is the subset {1,2,3}, then S A = {1,2,3,4,5,6}. If 
A is the sequence {1, 1, 3}, then Sa = {1> 2, 3, 4, 5}. 

Following Erdos [1], we say that A is complete if Sa = G and incomplete otherwise. 
If G is finite, the critical number of G, c(G), is the smallest integer m such that 
any subset A C G\{0} with size m is complete. This parameter has been studied 
for a long time and its exact value is known for most groups. 

Theorem 1.1. Let G be a finite abelian group of order n — ph, where p is the 
smallest prime divisor of G. Then the following holds 



• If p = 2 and h > 5 or G — Z 2 ® Z 2 ® Z 2 , then c(G) — h. If p — 2 and 
h<A and G ^ Z 2 © Z 2 © Z 2 , then c(G) =h + l. 

• If h is a prime, then p+h — 2 < c(G) < p+h — 1. Furthermore, if h — p > 3 
or h > 2p + 1, then c(G) =p + h-2. 

• If P> 3 and h is composite, then c{G) = p + h — 2. 



V. Vu is an A. Sloan Fellow and is supported by an NSF Career Grant. 

1 



2 



VAN H. VU 



The first statement is due to Didcrrich and Mann [7] . The second combines results 
of Mann and Wou [10] ( who studied the case h = p) and Didderich [6]. The last 
statement has been known as Didderich conjecture, posed in [6] and was proved by 
Gao and Hamidoune [8], more than twenty years later. 

In this paper, we would like to study the following question 

What is the structure of a relatively large incomplete set ? 

Technically speaking, we would like to have a characterization for incomplete sets 
of relatively large size. Such a characterization has been obtained recently in [9] 
for sets of size at least n/(p + 2). In this paper, we will be able to treat much 
smaller sets. (In fact, our assumption on "relatively large" is almost sharp; see 
Theorem 2.9.) The method used in our proofs is different from those used in 
previous papers. As a by-product, one obtains a new proof for a good portion of 
Theorem 1.1, including a new proof for Didderich's conjecture for large n (see the 
remarks following Theorem 2.7). 

Notation. < A > denotes the subgroup generated by A. E(X) denotes the expec- 
tation of a random variable X. All logarithms have natural base, if not specified 
otherwise. 



2. The characterization of large incomplete sets 

Let us start by a simple fact, whose proof is left as an exercise. 

Fact 2.1. Let p be a prime and A be a sequence of p — 1 non-zero elements in Z p . 
Then Sa n {0} = Z p . On the other hand, there is a sequence of p — 2 non-zero 
elements of Z p such that Sa n {0} ^ Z p . 

Let G be an abelian group of size n and q be a prime divisor of n. Let H be a 
subgroup of size n/q. A direct corollary of Fact 2.1 is the following 

Fact 2.2. If A is an incomplete subset of G and Sadh = H, then H\A has at 
most q — 2 elements. Consequently A has at most n/q + q — 2 elements. 

Definition 2.3. Let G be an abelian group of size n. A subset A of G is nice if 
there is a subgroup H of G such that \G/H\ is a prime and Sadh = H. 

Given a subgroup H in G and an element a e G, we use a/H to represent the coset 
of H which contains a. a/H can be viewed as an element of the quotient group 
G/H. If B is a subset of G, then B/H := {b/H\b e B} is a sequence in G/H. 

Fact 2.4. If A is a nice incomplete set in a finite abelian group G of size n, then 
I -A | < - p L +p — 2, where p is the smallest prime divisor ofn. Furthermore, (A\H)/H 
is an incomplete sequence in Z q = G/H. 
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Proof (Proof of Fact 2.4) If A is a nice incomplete set then \A\ < ^ + q — 2, for 
some prime divisor q of n. On the other hand, it is easy to see that ^ + q < j + p, 
where p is the smallest prime divisor of n. ■ 

Our leading idea is that relatively large incomplete sets are nice. A special case 
has been verified by Gao, Hamidoune, Llado and Serra [9]. Their theorem can be 
reformulated in the current setting as follows 

Theorem 2.5. Let G be an abelian group of order n — ph, where p > 5 is the 
smallest prime divisor of n, h> 15p is composite. Let A be an incomplete subset 
of at least + p elements. Then A is nice. Furthermore, there is a subgroup H 
of size n/p such that Sadh = H. 

For any positive e < 1, define 



and let n(e) be the smallest integer m such that for any n> m 

4 

n > C(e) v nlogn > 

e z 

Remark 2.6. n(e) is relatively small. One can take, say, n(e) = 500/e 4 . 



Now we are ready to state our first theorem. 

Theorem 2.7. Let 8 be a positive constant at most 1/6 andp\ < p2 ■ ■ ■ < pt, t > 2, 

be primes satisfying three conditions 



• P2 > 3; 

• Pi < ^yV"/ lo S"- 



Let G be an abelian group of order n and A be an incomplete subset of G of size at 
least (5/6 + 5)-^. Then A is nice and there is a subgroup H such that n/\H\ is one 
of the pi, \A\H\ < 3pi and Sahh = H . 

Remark 2.8. Let us have a few comments on this theorem. 



• Using Theorem 2.7 and Facts 2.2, we can recover a large portion of Theorem 

1.1. To see this, consider an incomplete set A which does not contain zero. 
If \A\ < n/pi, there is nothing to prove. If \A\ > n/pi, and n — \G\ 
satisfies the assumptions of Theorem 2.7, apply this theorem to obtain the 
subgroup H. As A does not contain zero, then |^4niJ| < \H\ — 1. By Facts 

2.2, \A\ < n/q + q — 3, where q = n/\H\. But n/q + q < n/pi + pi, so 
\A\ <n/pi+pi-3. 

• The third assumption p\ < j^yn/ log n in Theorem 2.7 can be voided if 
we assume t > 3 (i.e., n/pi is composite) and n sufficiently large. In that 
case pi < n 1 / 3 <C \Jnj logra. It follows that the assumptions of Theorem 
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2.7 arc satisfied whenever p\ > 3, h is composite and n is sumciently large. 
Thus, we have a new proof of Diddcrich conjecture for sufficiently large n. 
• Unlike Theorem 2.5, one cannot conclude that H has size n/p\. It is easy 
to give examples where |G|/|i?| can be any of the pi. 

The next question is to find the best lower bound on \A\ that guarantees niceness. 
Our second theorem gives an almost complete answer for this question. 

Theorem 2.9. For any positive constant S there is a positive constant D(S) such 
that the following holds. Let pi < p2 ■ ■ ■ < Pt, t > 3, be primes such that p\P2 < 
T^jyVW logn 7 where n := n!=iP»- Let G be an abelian group of order n and A 
be an incomplete subset of G with cardinality at least (1 + S)^^. Then A is nice. 

Furthermore, the lower bound (1 + 5)^^ cannot be replaced by + n 1 / 4- ", for 
any constant a. 

Finally, let us discuss the case when G = Z n , where n is a prime. This case has not 
been covered by the results presented so far. Olson [13], improving upon a result of 
Erdos and Hcilbronn [2], shows that c(Z n ) < \/4n — 3 + 1. His bound was improved 
by da Silva and Hamidoune [3] to ^JAn — 7. As far as characterization results are 
concerned, we know of the following two results. 

Theorem 2.10. Let n be a prime and A be an incomplete subset ofL n of size at 
least (2n) 1 / 2 . Then there is some non-zero element b € Z p such that 

IHI < « + 0(n 3 / 4 logn). 

a£bA 

Theorem 2.11. Let n be a prime and A be an incomplete subset of Z„ of size at 
least 1.997T. 1 / 2 . Then there is some non-zero element b e Z p such that 

IHI <n + C(n 1/2 ). 

aGbA 

Theorem 2.10 is due to Deshouillcrs and Freiman [5]. Theorem 2.11 is due to 
Nguyen, Szemeredi and Vu [11]. The error term in this 0(n 1//2 ) is best possible, as 
shown by a construction in [4] . 

The rest of the paper is organized as follows. Section 3 contains the main lemma to 
the proofs, which states that if A is sufficiently large, then Sa contains a subgroup 
of size comparable to \A\. The proofs of the theorems come in Sections 4 and 5. 
Section 6 is devoted to concluding remarks. 



3. The existence of a large subgroup in Sa 

Our key tool is the following statement, which asserts that if A is a sufficiently 
large subset of G, then Sa contains a large subgroup of G. Recall the definition of 
C(e) and n(e) from (1) and (2). 
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Theorem 3.1. Let < e < 1 be a constant and G be an abelian group of size n, 

where n> max{|-, C(e)\/nlogn}. Let A be a subset of G with at least max{ ^ , C(e) y/n log n} 

elements. Then Sa contains a subgroup of size at least (1 — e)\A\. 

Remark 3.2. The bound (1 — c)\A\ is asymptotically sharp, as A itself can be a 
subgroup. The lower bound C\Jn logn for \A\ is sharp up to the logarithmic term. 
To see this, consider G = Z p 2 and A = {0, 1 . . . ,p}. It is clear that \A\ > p = \/\G\. 
On the other hand, Sa does not contain any proper subgroup of G. It is interesting 
to see whether the log term can be removed. 

Remark 3.3. The theorem also holds for non-abelian group, see Theorem 3.9 at the 
end of this section. 

By definition of n(e), if n > n(e) then 

4 

n > C(e) y' nlogn > —. 

In this case we have the following corollary, which is easier to use. 

Corollary 3.4. Let < e < 1 be a constant and G be an ablian group of size 
n > n(e). Let A be a subset of G with at least C(e)\/n\ogn elements. Then Sa 
contains a subgroup of size at least (1 — e)\A\. 

To prove Theorem 3.1, we use the following result of Olson [12] (see also [15, 
Chapter 12]). Let A be a set and I be a positive integer, we define 

IA := {ai + . ..a t \ai G A}. 

Also recall that < A > denotes the subgroup generated by A. 

Theorem 3.5. Let G be finite abelian group, I be a positive integer and G A be 
a finite subset ofG. Then either IA=< A> or \IA\ > \A\ + (I - 1)(^ + !)• 

Since \ A\ + (I — 1)(^ + 1) > (/ + l)|A|/2, the following corollary is immediate. 

Corollary 3.6. Let G be a finite abelian group, I be a positive integer and G A 
be a finite subset of G such that (7 + l)| J 4|>2|G|, then 

IA=<A>. 

We also needs the following result of Olson [12], which refines an earlier result of 
Szemeredi [14] (Szemeredi proved the theorem for an unspecified constant instead 
of 3). 

Theorem 3.7. Let G be an abelian group of order n and A be subset of at least 
3^/n elements. Then G Sa- 

We also need the following simple lemma: 

Lemma 3.8. Let G be an abelian group and A be a subset of G. Let I be a 

positive integer and S a subset of G such that every element of S can be represented 
as the sum of two different elements of A in at least 21 — 1 ways ( not counting 
permutations). Then IS C Sa. 
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Proof (Proof of Lemma 3.8) Let x\,...x\ be (not necessarily different) elements 

of S. We represent x\ H h xi as a sum of different elements of A using the greedy 

algorithm. To start, represent x\ = a\ + a[ where a\ ^ a[ are different elements of 
A. Assume that we have represented x\ = a\ + a[, . . . , Xi = a^ + a-, where 1 < i < I 
and oi, a[, . . . , a i: a- are all different. Now look at x i+i . Each of the 2i elements 
oi, a[, . . . , di, a'i appear in at most one representation of x i+i . Since Xi+\ has at 
least 21 — 1 > 2i representations, we can find a representation x i+ i = a i+ i + a' i+1 
where both a i+ i and a' i+1 are different from a\, a[, . . . , a i; a-. This concludes the 
prof. ■ 



Proof (Proof of Theorem 3.1) For each element x e G, let m x be the number 
of ways to represent x as the sum of two different elements of A (not counting 
permutations). A double counting argument gives 

S> = ('?)• (3) 

Notice that m x is at most M :— \A\/2. Set K := 2/e. Let Sj be the collection 
of those x where K~iM < m x < K~i +1 M for j = l,...,jo, where jo is the 
largest integer such that K~^°M > 1. Let Sj 0+ i be the collection of those x where 
1 < m x < K- jo M. By the definition of Sj 

J'o + l 

'£ / K-*+ 1 M\S j \>'E i m x , (4) 

j=i xea 



which, together with (3) imply 

Jo+l 



J2K-^M\ Sj \>( l f) (5) 



Call a set Sj small (j = 1, . . . , jo + 1) if it has at most (1 — e)\A\ elements and large 
otherwise. The contribution from the small Sj on the left hand side is at most 

£ K-^M(1 e)\A\ < ^-M(l - e)\A\ = (1 - _f_)l4! 

taken into account the facts that K = 2/e and M = \ A\/2. Since \A\ > 4r, we have 

(1— !-)li!!<(l-e/2)i^-M. 
1 2-e 2 ~ K ' ' 2 2 

From this and (5), we have 



£ A— M ra >(lf)- (1 - e / 2 ,^ + M = £ W 



(6) 



Sj large 



The bound \A\ > C{e)^Jn logn guarantees that 

|A' 2 
1 



e^>5tfnlog 2/e n. (7) 
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(In fact, C(e) is defined so that this inequality holds.) Set lj := K~iM. Since the 
number of large Sj is at most jo + 1 < U°g2/c l-^l/^J + 1, (6), (7) and the pigeon 
hole principle imply that there is a large Sj such that 

l 3 \S 3 \>4n. 

Notice that \Sj\ < \G\ = n. It follows that ([lj/2\ + l)\Sj\ > 2n. Apply Corollary 
3.4 to I := [lj/2\ and S := Sj U {0}, we can conclude that IS =< S >. On the 
other hand, by the definition of S 

IS = U l i=1 iSj U {0}. 

By Lemma 3.8, 

iSj C Sa, 

for all 1 < % < I. Finally, G Sa by Olson's theorem. Thus Sa contains < A >, 
which has at least (1 — e)\A\ elements since Sj is large and | < S > | > \S\ > \Sj\. 
This concludes the proof. ■ 

All the tools used in the proof (Theorems 3.5 and 3.7, Lemma 3.8) hold for non- 
abelian groups. Thus, Theorem 3.1 also holds for this case. The proof requires only 
two simple modifications. First, in Lemma 3.8, 2/ — 1 is replaced by 41 — 3. The 
reason is that in the proof, each of the elements oi, a'i, . . . , a%, a\ can now appear 
in at most 2 representations of x i+1 . The second is that in the proof of Theorem 
3.1, we need to fix an ordering on the elements of G and when we consider a sum 
x + y, we always assume that x precedes y in this ordering. The rest of the proof 
remains the same. 

Theorem 3.9. For any constant < e < 1 there are constant n\{e) and C\(e) such 
that the following holds. Let G be a group of size n, where n > ni(e). Let A be a 
subset of G with at least C\(e)\/n logn} elements. Then Sa contains a subgroup of 
size at least (1 — e)\A\. 

The values of rai(e) and Ci(e) might be slightly different from that of n(e) and 
C(e), due to the modifications. 

4. Proof of Theorem 2.7 

Lemma 4.1. Let G be a finite additive group and A be a subset ofG with cardinality 
at least [\G\/2\ + 2. Then S A = G. 

Proof (Proof of Lemma 4.1) Let x be an arbitrary element of G. There are exactly 
[|G|/2J (unordered) pairs (a, b) of different elements of G such that a + b = x. The 
claim follows by the pigeon hole principle. One can improve the bound slightly but 
from our point of view it is not important. ■ 

Proof (Proof of Theorem 2.7) Let A\ be an arbitrary subset of A with cardinality 
(1 + 2<5)^-. By the upper bound on pi, we can assume that \A\\ > C{8)\fn\ogn, 



8 



VAN H. VU 



which enables us to apply Corollary 3.1 to A\ and obtain a subgroup H C Sa-l 
where 

Ti T) 

\H\>(l-5)\A 1 \ = (l-5)(l + 25)—>—. 

ipi 3pi 

The assumption p 2 > 3 shows that H > It follows that \H\ = n/q where q is 

one of the pi ( 1 < i < t). Furthermore, 

q < 3pi. 

Consider the sequence B := {a/H\a e j4\^4i} in the quotient group G/H = Z q . If 
B has at least q— 1 non-zero elements, then by Fact 2.1 Sb contains Z q \{0}, which 
implies that 

G c SUi + SU\Ai c Sa 
a contradiction as A is incomplete. Thus, B has at most q — 2 non-zero elements. 
So we can conclude that all but at most q — 2 elements of A\Ai lie in i7. Let A 2 
denote the set of these elements. We have 

\A 2 \ > \A\A 1 \-(q-2) > |A| -3pi + 2> (| + <5 - (1 + 25)^j j - 3pi + 2. 

The right most formula is 

(J + ^))--3pi + 2>^-+2. 
2 3 pi 2pi 

since > 3pi by the assumption p\ < 3( }^ \Jn/ logn and the definition of 

C(5). 

On the other hand, \H\ is at most Thus, |^4 2 | > \H\/2 + 2 and so by Lemma 
4.1, 5' J 4 2 = H. Notice that A 2 C H n A. Thus SUnff = # which means that A is 
nice, completing the proof. ■ 



5. Proof of Theorem 2.9 



Without loss of generality, we can assume that 5 < 1/2 and A has exactly (1+5) 
elements. Let A\ be a subset of A of size (1 + 5/2) Setting D{8) sufficiently 
large, one can assume that n is sufficiently large and \A\ \ > C(5/A)^n\ogn (where 
C is defined as in (1)), thanks to the assumption 

P1P2 < j^\/nJ\ogn. 

This enables us to apply Corollary 3.4 to A\ and conclude that Sa x contains a 
subgroup H of size at least 

n n 

(1 - tf/4)|Ai| = (1 - <J/4)(1 + 5/2) > . 

PlP2 P\P2 

The critical point here is that \H\ is larger than This forces \H\ = n/q where 

q is one of the primes p%. It would be easy to finish the proof now if A had at least 
(2 + 5)j^ (instead of only (1 + <5)^j) elements. The reason is that in this case 
we still have (1 + 5/2)^^ elements outside A\ to play with. Arguing as in the 
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previous proof, we can show that most of these elements should be in H and span 
it. As we lack these extra elements, we need an additional trick that helps us to 
show that actually most elements of A\ are already in H. The heart of this trick 
is Lemma 5.1 below. Before presenting the lemma, let us make some observations. 
Set A 2 := A\A\. As Ai was chosen arbitrarily, A2 is an arbitrary subset of A with 
2 p™ pi elements. Since A is incomplete, |A2\i?| < q — 2, where q = |G|/|Zf | < pip 2 - 
By setting D{8) sufficiently large, we can assume 

PlP2 < ™ = 777 ^2 

20pip 2 10 



which implies 

\A 2 r\H\ > \A 2 \ - PlP2 > (1-S/IQ)\A 2 \. 

To summarize, A has the property that for any subset A 2 of size 2 p™ P2 = 2(1+6) 1^1' 
there is a maximal subgroup H of G such that I.A2 H if | > (1 — <5/10)j A2I- 

Lemma 5.1. Let S be a subset of G of size (1 + S)— — such that no maximal 

j j \ ' P1P2 

subgroup of G contains (1 — 8/2) fraction of S. Then there is a subset S 1 C S of 
size 2(1+5) \S\ suc h ^at no maximal subgroup of G contains (1 — 8/10) fraction of 
S'. 



Assuming the lemma for a moment, we can conclude the proof as follows. By the 
lemma and its preceding paragraph, there is a maximal subgroup H such that 

71 71 

\H n A\ > (1 - 8/2)\A\ = (1 - S/2)(l + 8) > (1 + 6/4) 

PlP2 P\P2 

as 8 < 1/2. Since \H\ < n/pi and the smallest prime divisor p' of H is either p\ or 
P2, it is easy to verify that 

\HnA\> 15 + P '. 
p 

Thus we can apply Theorem 1.1 or Theorem 2.7 for H and A n H to deduce that 
A n H is complete in H . Therefore, Sahh = H and A is nice. 

Now we prove Lemma 5.1, using a probabilistic argument. Proof (Proof of 
Lemma 5.1) Set s := \S\ = (1 + 8)^^ and e := 8/10. Consider a random subset 
Si of S obtained by selecting each element a € S to be in Si with probability 
p := (1 + 2e) 2 (i+sy i ' independently. Let H be a subgroup of G. By linearity of 
expectation and the assumption of the lemma , we have 

E(|ff n Si |) = p\H ^S\< P (l - 6/2)s = p(l - 5e)s. 

On the other hand, E(|5i|) = ps. Both H n Si and Si have binomial distribution. 
By property of the binomial distribution, there is a positive constant cq depending 
only on e such that with probability at least 1 — exp(— c ps) 

(l-e)ps<\S 1 \<(l + e)ps. (8) 

and 



1-ffnSil < (l + e)p(l - 5e)s. 



(9) 
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It is well known (and easy to prove) that the number of maximal subgroups of G 
is at most \G\ = n. If D(6) (and so n) is sufficiently large, then 

2n < exp(cops). 

Thus, we can use the union bound to conclude that there exists a set Si such that 
(8) holds and (9) holds simultaneously for every maximal subgroup H. Let S' be 
any subset of Si of size 2 n™n 2 ~ JT2eP s - ^ 0T any maximal subgroup H 

\S' n H\/\S'\ < \Si n H\/\S'\ < (1+ / £)( 1 1 ^ 5£)PS < (1 - e) - (1 - 5/10). 

This concludes the proof of the lemma. ■ 



The following example shows that the lower bound (1 + <5)^^ cannot be reduced 
to + n 1 / 4- ", for any fixed a. 

Example. Take n := p 2 q where 1 < p < q are large primes. Consider G = Z p i®Z q . 
Given any 5 > and any function D(S), by choosing q properly p we can guarantee 
that 

D{6) 

We write an element a € G as a = (x, y) where x € Z p 2 and y G Z q . Let m be the 
largest integer such that Y^iLo i <p 2 — t- Set 

A := {(x, 0)|0 < x < m} U {{0,y)\0 <y<q-l}. 

It is easy to show that A is incomplete and not nice, thanks to the fact that 
Y^iLo i < P 2 — ^- On the other hand, 

\A\ = m + q = m+ — = mA > n 1/4 + 



P 2 P\P2 P\Pl 



The proof of the theorem is complete. 



6. Concluding remarks 



One can use the additional trick in the proof of Theorem 2.9 to improve upon the 
constant (5/6 + S) in Theorem 2.7. However, this requires some modification on 
the assumptions. We prefer to present Theorem 2.7 in the simplest way in order to 
illustrate the ideas. 

One can also use the method presented here to study incomplete sets with size less 
than —2— . However, the characterization obtained in this case is more technical 
and less appealing. 
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